智能论文笔记

Causal Inference under Outcome-Based Sampling with Monotonicity Assumptions

Sung Jae Jun , Sokbae Lee

分类： (统计)机器学习

2020-04-17

我们研究病例对照和病例人口抽样下的因果推断。为此，我们专注于二进制结果和二进制处理情况下，如果感兴趣的参数是因果相，并通过潜在的结果框架中定义归因危险。结果表明，强ignorability并不总是有力，因为它是根据随机取样和某些单调的假设产生了鲜明的识别的时间间隔的条件比较的结果。具体而言，通常的比值比被示出为一锋利的识别上下单调治疗反应和单调治疗选择的假设因果相对风险的上限。然后，我们讨论的平均条件（对数）的比值比，当平均是依据只能是在数据确定的协变量的（条件）分布提出的算法semiparametrically有效的估计。我们还为因果推理算法，如果协变量的真实人口分布是理想的聚集。我们证明我们的方法通过研究，从社会科学两个经验的例子用处：就读私立学校进入巴基斯坦一所名牌大学并留校和卷入与巴西贩毒团伙之间的因果关系的好处。

translated by 谷歌翻译

L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training

Jonghyun Bae , Woohyeon Baek , Tae Jun Ham , Jae W. Lee

分类：计算机视觉

2022-08-18

深度神经网络（DNN）的训练过程通常是用阶段进行管道的，用于在CPU上进行数据制备，然后对GPU等加速器进行梯度计算。在理想的管道中，端到端训练吞吐量最终受到加速器的吞吐量的限制，而不是数据准备。过去，DNN训练管道通过使用使用轻巧，有损的图像格式（如JPEG）编码的数据集实现了近乎最佳的吞吐量。但是，随着高分辨率，无损编码的数据集变得越来越流行，对于需要高精度的应用程序，由于CPU上的低通量图像解码，在数据准备阶段出现了性能问题。因此，我们提出了L3，这是一种用于高分辨率，高通量DNN训练的定制轻巧，无损的图像格式。 L3的解码过程在加速器上有效平行，从而最大程度地减少了在DNN培训期间进行数据制备的CPU干预。 L3比最流行的无损图像格式PNG获得了9.29倍的数据准备吞吐量，用于NVIDIA A100 GPU上的CityScapes数据集，该数据集可导致1.71倍更高的端到端训练吞吐量。与JPEG和WebP相比，两种流行的有损图像格式，L3分别以同等的度量性能为Imagenet提供高达1.77倍和2.87倍的端到端训练吞吐量。

translated by 谷歌翻译

Learning from Data with Noisy Labels Using Temporal Self-Ensemble

Jun Ho Lee , Jae Soon Baik , Tae Hwan Hwang , Jun Won Choi

分类：计算机视觉

2022-07-21

实际数据集中不可避免地有许多错误标记的数据。由于深度神经网络（DNNS）具有记忆标签的巨大能力，因此需要强大的训练方案来防止标签错误降低DNN的概括性能。当前的最新方法提出了一种共同训练方案，该方案使用与小损失相关的样本训练双网络。但是，实际上，培训两个网络可以同时负担计算资源。在这项研究中，我们提出了一种简单而有效的健壮培训计划，该计划仅通过培训一个网络来运行。在训练过程中，提出的方法通过从随机梯度下降优化形成的重量轨迹中抽样中间网络参数来生成时间自我启动。使用这些自我归档评估的损失总和用于识别错误标记的样品。同时，我们的方法通过将输入数据转换为各种形式，并考虑其协议以识别错误标记的样本来生成多视图预测。通过结合上述指标，我们介绍了提出的{\ it基于自动化的鲁棒训练}（SRT）方法，该方法可以用嘈杂的标签过滤样品，以减少其对训练的影响。广泛使用的公共数据集的实验表明，所提出的方法在某些类别中实现了最新的性能，而无需训练双网络。

translated by 谷歌翻译

Efficient and Privacy Preserving Group Signature for Federated Learning

Sneha Kanchan , Jae Won Jang , Jun Yong Yoon , Bong Jun Choi

分类：机器学习

2022-07-12

联合学习（FL）是一种机器学习（ML）技术，旨在减少对用户数据隐私的威胁。培训是使用用户设备上的原始数据（称为客户端）进行的，只有称为梯度的培训结果被发送到服务器进行汇总并生成更新的模型。但是，我们不能假设可以使用私人信息来信任服务器，例如与数据所有者或数据源相关的元数据。因此，将客户信息隐藏在服务器中有助于减少与隐私相关的攻击。因此，客户身份的隐私以及客户数据的隐私是使此类攻击更加困难的必要条件。本文提出了基于组签名的FL的高效和隐私权协议。一个名为GSFL的新组合签名旨在保护客户数据和身份的隐私，而且考虑考虑到联合学习的迭代过程，还大大降低了计算和通信成本。我们表明，在计算，通信和信号成本方面，GSFL优于现有方法。另外，我们表明所提出的协议可以在联合学习环境中处理各种安全攻击。

translated by 谷歌翻译

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Jae Soon Baik , In Young Yoon , Jun Won Choi

分类：计算机视觉 | 机器学习

2022-07-05

现代深度学习在各个领域取得了巨大的成功。但是，它需要标记大量数据，这是昂贵且劳动密集型的。积极学习（AL）确定要标记的最有用的样本，对于最大化培训过程的效率变得越来越重要。现有的AL方法主要仅使用单个最终固定模型来获取要标记的样品。这种策略可能还不够好，因为没有考虑为给定培训数据的模型的结构不确定性来获取样品。在这项研究中，我们提出了一种基于常规随机梯度下降（SGD）优化产生的时间自我汇总的新颖获取标准。通过捕获通过SGD迭代获得的中间网络权重来获得这些自我复杂模型。我们的收购功能依赖于学生和教师模型之间的一致性度量。为学生模型提供了固定数量的时间自我安装模型，并且教师模型是通过平均学生模型来构建的。使用拟议的获取标准，我们提出了AL算法，即基于学生教师的AL（ST-Conal）。在CIFAR-10，CIFAR-100，CALTECH-256和TINY IMAGENET数据集上进行的图像分类任务进行的实验表明，所提出的ST-Conal实现的性能要比现有的获取方法要好得多。此外，广泛的实验显示了我们方法的鲁棒性和有效性。

translated by 谷歌翻译

DBN-Mix: Training Dual Branch Network Using Bilateral Mixup Augmentation for Long-Tailed Visual Recognition

Jae Soon Baik , In Young Yoon , Jun Won Choi

分类：计算机视觉 | 机器学习

2022-07-05

人们对从长尾班级分布中学习的具有挑战性的视觉感知任务越来越兴趣。训练数据集中的极端类失衡使模型偏向于识别多数级数据而不是少数级数据。最近，已经提出了两个分支网络的双分支网络（DBN）框架。传统的分支和重新平衡分支用于提高长尾视觉识别的准确性。重新平衡分支使用反向采样器来生成类平衡的训练样本，以减轻由于类不平衡而减轻偏见。尽管该策略在处理偏见方面非常成功，但使用反向采样器进行培训可以降低表示形式的学习绩效。为了减轻这个问题，常规方法使用了精心设计的累积学习策略，在整个培训阶段，重新平衡分支的影响逐渐增加。在这项研究中，我们旨在开发一种简单而有效的方法，以不需要优化的累积学习而在不累积学习的情况下提高DBN的性能。我们设计了一种称为双边混合增强的简单数据增强方法，该方法将统一采样器中的一个样品与反向采样器中的另一个样品结合在一起，以产生训练样本。此外，我们介绍了阶级条件的温度缩放，从而减轻对拟议的DBN结构的多数级别的偏见。我们对广泛使用的长尾视觉识别数据集进行的实验表明，双边混合增加在改善DBN的表示性能方面非常有效，并且所提出的方法可以实现某些类别的先进绩效。

translated by 谷歌翻译

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning

Jack Hessel , Jena D. Hwang , Jae Sung Park , Rowan Zellers , Chandra Bhagavatula , Anna Rohrbach , Kate Saenko , Yejin Choi

分类：计算机视觉 | 自然语言处理

2022-02-10

人类具有出色的能力来推理绑架并假设超出图像的字面内容的内容。通过识别散布在整个场景中的具体视觉线索，我们几乎不禁根据我们的日常经验和对世界的知识来提出可能的推论。例如，如果我们在道路旁边看到一个“ 20英里 /小时”的标志，我们可能会假设街道位于居民区（而不是在高速公路上），即使没有房屋。机器可以执行类似的视觉推理吗？我们提出了Sherlock，这是一个带注释的103K图像的语料库，用于测试机器能力，以超出字面图像内容的绑架推理。我们采用免费观看范式：参与者首先观察并识别图像中的显着线索（例如，对象，动作），然后给定线索，然后提供有关场景的合理推论。我们总共收集了363K（线索，推理）对，该对形成了首个绑架的视觉推理数据集。使用我们的语料库，我们测试了三个互补的绑架推理轴。我们评估模型的能力：i）从大型候选人语料库中检索相关推论； ii）通过边界框来定位推论的证据，iii）比较合理的推论，以匹配人类在新收集的19k李克特级判断的诊断语料库上的判断。尽管我们发现具有多任务目标的微调夹RN50x64优于强大的基准，但模型性能与人类一致之间存在着重要的净空。可在http://visualabduction.com/上获得数据，模型和排行榜

translated by 谷歌翻译

Further Improving Weakly-supervised Object Localization via Causal Knowledge Distillation

Feifei Shao , Yawei Luo , Shengjian Wu , Qiyi Li , Fei Gao , Yi Yang , Jun Xiao

分类：计算机视觉

2023-01-03

Weakly-supervised object localization aims to indicate the category as well as the scope of an object in an image given only the image-level labels. Most of the existing works are based on Class Activation Mapping (CAM) and endeavor to enlarge the discriminative area inside the activation map to perceive the whole object, yet ignore the co-occurrence confounder of the object and context (e.g., fish and water), which makes the model inspection hard to distinguish object boundaries. Besides, the use of CAM also brings a dilemma problem that the classification and localization always suffer from a performance gap and can not reach their highest accuracy simultaneously. In this paper, we propose a casual knowledge distillation method, dubbed KD-CI-CAM, to address these two under-explored issues in one go. More specifically, we tackle the co-occurrence context confounder problem via causal intervention (CI), which explores the causalities among image features, contexts, and categories to eliminate the biased object-context entanglement in the class activation maps. Based on the de-biased object feature, we additionally propose a multi-teacher causal distillation framework to balance the absorption of classification knowledge and localization knowledge during model training. Extensive experiments on several benchmarks demonstrate the effectiveness of KD-CI-CAM in learning clear object boundaries from confounding contexts and addressing the dilemma problem between classification and localization performance.

translated by 谷歌翻译

Surveillance Face Anti-spoofing

Hao Fang , Ajian Liu , Jun Wan , Sergio Escalera , Chenxu Zhao , Xu Zhang , Stan Z. Li , Zhen Lei

分类：计算机视觉

2023-01-03

Face Anti-spoofing (FAS) is essential to secure face recognition systems from various physical attacks. However, recent research generally focuses on short-distance applications (i.e., phone unlocking) while lacking consideration of long-distance scenes (i.e., surveillance security checks). In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks. In this scene, low image resolution and noise interference are new challenges faced in surveillance FAS. Together with the SuHiFiMask dataset, we propose a Contrastive Quality-Invariance Learning (CQIL) network to alleviate the performance degradation caused by image quality from three aspects: (1) An Image Quality Variable module (IQV) is introduced to recover image information associated with discrimination by combining the super-resolution network. (2) Using generated sample pairs to simulate quality variance distributions to help contrastive learning strategies obtain robust feature representation under quality variation. (3) A Separate Quality Network (SQN) is designed to learn discriminative features independent of image quality. Finally, a large number of experiments verify the quality of the SuHiFiMask dataset and the superiority of the proposed CQIL.

translated by 谷歌翻译

Edge Enhanced Image Style Transfer via Transformers

Chiyu Zhang , Jun Yang , Zaiyan Dai , Peng Cao

分类：计算机视觉

2023-01-02

In recent years, arbitrary image style transfer has attracted more and more attention. Given a pair of content and style images, a stylized one is hoped that retains the content from the former while catching style patterns from the latter. However, it is difficult to simultaneously keep well the trade-off between the content details and the style features. To stylize the image with sufficient style patterns, the content details may be damaged and sometimes the objects of images can not be distinguished clearly. For this reason, we present a new transformer-based method named STT for image style transfer and an edge loss which can enhance the content details apparently to avoid generating blurred results for excessive rendering on style features. Qualitative and quantitative experiments demonstrate that STT achieves comparable performance to state-of-the-art image style transfer methods while alleviating the content leak problem.

translated by 谷歌翻译